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Abstract. Ergodic Optimization is the process of finding invariant probability measures 
that maximize the integral of a given function. It has been conjectured that "most" func- 
tions are optimized by measures supported on a periodic orbit, and it has been proved in 
several separable spaces that an open and dense subset of functions is optimized by measures 
supported on a periodic orbit. All known positive results have been for separable spaces. 
We give in this paper the first positive result for a non-separable space, the space of super- 
continuous functions on the full shift, where the set of functions optimized by periodic orbit 
measures contains an open dense subset. 



1. Introduction 

Given an expansive map T : Q — > Q and a continuous function /, we say that a T-invariant 
probability measure u optimizes f if 



fdfi> J fdu 

for all T-invariant probability measures v. If y is a periodic point (i.e., T l y = y for some i), 
let fiy be the unique T-invariant probability measure supported on Oy, the orbit of y. We 
call fiy a periodic orbit measure. If fi y optimizes /, we will also say that / is optimized by 
the periodic point y. 

General Belief. "Most" functions are optimized by measures supported on a periodic orbit. 

"Most" can take various meanings, but for our purposes, we consider "most" to be an 
open dense set or a residual set. 

Conjecture 1. In an expansive dynamical system, the set of Lipschitz functions optimized by 
periodic orbit measures contains an open set that is dense in the class of Lipschitz functions. 

Analogs to Conjecture [1] have been shown false in the general case of continuous functions 
[6], however they have been shown true in a handful of separable spaces. Further, various 
numerical experiments on many important dynamical systems support this conjecture (and 
hint towards some very interesting relationships between parameterized families of functions 
and the period of optimizing orbits) jU EJ [S] . 

We present a non-separable space where the analog of Conjecture [T]holds true. Let Q = A N 
be the one-sided shift space on a finite alphabet. For a sequence A n \ 0, define a metric 
Aa{x, y) = A n if x and y first differ in the nth place (i.e. (x)i = (y)i for < i < n; 
(x) n ^ (y) n )- Let Ca(Q) denote the set of Lipschitz functions with respect to the g?a 
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metric, equipped with the (i^-Lipschitz norm. If {A n } satisfies the additional property that 
A n+ i/A n — > 0, we call / G Ca(£1) super continuous. 

Theorem 2. Suppose A = {A n } and A n+ i/A n — > 0. For a periodic orbit measure \x y 
supported on Oy, let P y = {/ G Ca(Q) : \i y is the unique maximizing measure}. Then, 
Uy periodicity) ^ s dense ^ n a ^ ofCj^l) under the A-norm topology (where (P y )° is the interior 

0fPy). ' 

We will briefly survey the most well-known positive results. A function / is a Walters 
function (introduced by Walters in |7|) if for every e > there exists a 6 > so that for all 
iiGN and x and y, 

max{d(Tx,Ty)}<5 \S n f(x)-S n f(y)\<e, 

0<i<n 

where S n f(w) = Y^7=o f{T % w). Bousch shows for Walters functions, the analog of Conjecture 
□ holds 0. 

Contreras, Lopes, and Thieullen showed in [3] that when using a Holder norm external to 
a particular union of Holder spaces, the analog of Conjecture [T]for Holder spaces holds. Yuan 
and Hunt made significant progress towards proving Conjecture [H though the full result has 
not yet been proved. 

Presented are the already-established theorems for comparison. Note that although the 
theorems are stated in a variety of contexts (expanding maps of the circle, one-sided shifts 
etc.), the essence of the problem is present in the simple setting of the one-sided shift. 

Theorem (Bousch j2]). Let T : X — >■ X be the one-sided shift map and let W denote the set 
of Walters functions on X . If P dW is the set of Walters functions optimized by measures 
supported on periodic points, then P contains an open set dense in W with respect to the 
Walters norm. 

Theorem (Contreras-Lopes- Thieullen |3j). Let T be a C 1+a expanding map of the circle. 
Let Hp be the set of f3 -Holder functions on S 1 and let F a+ = [J /3>a Hi 3 . Let P a+ C J^ a+ be 
the subset of functions uniquely optimized by measures supported on a periodic point. Then 
P a+ contains a set that is open and dense in J-" a+ under the H a topology (i.e., the a-Holder 
norm). 

Theorem (Yuan and Hunt [9]). Let T : M — > M be an Axiom A map or an expanding 
map from a manifold to itself and let Cu P denote the class of Lipschitz continuous functions. 
For any f G Cu P optimized by a measure generated by an aperiodic point, there exists an 
arbitrarily small perturbation of f such that that measure is no longer an optimizing measure. 
Further, any f G Cu P optimized by a periodic orbit measure can be perturbed to be stably 
optimized by this periodic orbit measure. 

With the inclusion of this paper, the current state of the standing conjecture is some- 
what curious. Notice that super-continuous functions are Lipschitz functions and Lipschitz 
functions are Walters functions. So, for both a larger and a smaller class than Lipschitz 
functions, analogs of Conjectured] have shown to be true, and yet proof of the Lipschitz case 
remains elusive. 

1.1. Notation & Definitions. For some finite alphabet A, let Q = be the space of 
one-sided infinite sequences on A. For us N includes 0. 
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T : Q — > Q is the usual shift operator, with T-invariant Borel probability measures on Q 
denoted Ai. We write Ox for the orbit of x under T, and we say S is a segment of Ox if it 
is an ordered list of the form (T l x,T l+1 x, . . . ) T l+p ~ l x) for some i,p. Abusing notation, we 
may say S C Ox. 

We use d to denote the standard metric on sequences. That is, d(x, y) = 2~ k where 
k = inf{i : (x)i ^ (y)i} and (z)i is the ith symbol of z. We follow the convention that 
2-°° = 0. 

Definition 3 (Shadowing). For two points x,y, we say that x e-shadows a segment S = 
(T m y,...,T m+n - l y)cOy if 

d(T l x, T i+m y) < e 

for all < i < n. 

Definition 4 (e-close). A point x is said to stay e-close to a set Y for p steps if for all 
<i <p, 

d{T l x,Y) < e. 

Notation 5 (Ergodic Average). For a function f and a point x, 

1 N ^ 

</>(*)= Mm t^E^)' 

i=0 

when the limit exists. 

Notation 6. If x — a a\a2 ■ ■ ■ is a point, 

(x)l = aia i+ i ■ ■ ■ dj-xdj 
is the subword of x from position i to j . 

2. Summable Variation 

Definition 7 (Variation). The variation of a function over level k cylinder sets is the max- 
imum a function changes in a distance of2~ k . That is, if f is a function 

vax fc (/) = sup{|/(a;) - f(y)\ : d(x,y) < 2~ k }. 

Note that in a shift space, we have additional structure because distances can only take 
values of the form 2~ k . 

Definition 8 (Summable Variation). The function f is of summable variation if 

oo 

J^var fe (/) < oo. 

Notation 9. Vk(f) represents the tail sum of the variation of f over distances smaller than 
2- k+1 . That is 

oo 

V fe (/) = ]Tvar,(/). 

j=k 
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Functions of summable variation form a much larger class than Lipschitz functions. How- 
ever the general method used in this paper to show Theorem [2] is to perturb functions by 
a small multiple of some canonical "sharpest" function. Yuan and Hunt used this strategy 
when dealing with Lipschitz functions by perturbing by —d(x,Oy) [9J. But, for functions 
of summable variation (with the natural norm of ||/|| = V (f) + ||/||oo), there is no such 
"sharpest" function. Using the A-norms gives us these sharpest functions again. 

We will frequently refer to A-metrics and A-norms as briefly introduced earlier. 

Definition 10 ( A-sequence) . An A-sequence, (A n )'^ =0 , is a decreasing sequence of positive 
numbers with A n — > 0. 

// there exists < S < 1 such that A n+ i/A n < 1 — 5 for each n, then we say that (A n ) is 
lacunary. 

Recall that the metric d& is defined by = A n if = for < i < n but 

0)n ^ (y)n- 

Definition 11 (A- norm). If (A n ) is an A-sequence, the Lipschitz constant of f is Lip A (/) = 
sup fc Y&r k {f) / A k . The A-norra is defined by \\f\\A = Lip A (/) + 



Of course if A is the sequence (2~ n )^L , we recover the standard distance and Lipschitz 
norm. We write the set of Lipschitz functions with respect to dA as Ca(Q) or simply Ca- 

Notice that since A satisfies A n — > 0, Ca(Q) C C(Q) is a subset of the continuous functions 
on Q. Further, Ca is a non-separable Banach space as the functions f x {-) = d(x, •) for x G f2 
are an uncountable uniformly discrete set. 

3. Preliminary lemmas 

We will first establish several results that do not depend on super continuity. 

Definition 12 (In Order for One Step). For points x, y, let S = (T^y, T^ +1 y, . . . , T^ +k y) C 
Oy, and suppose that there is a unique closest point y' G S to x. That is, 

d(x,y')<d(x,S\{y'}). 

We say that x follows S in order for one step ifTy' G S and Ty' is the unique closest point 
to Tx. That is Ty' G S and 

d(Tx,Ty')<d(Tx,S\{Ty'}). 

Definition 13 (In Order). For some point y, let S = (T J y,T J+1 ?/, . . . , T^ +k y) C Oy. For 
some point x, we say that x follows S in order for p steps if x, Tx, . . . , T v ~ x x each follow S 
in order for one step. 

Following in order is very similar to the concept of shadowing except that the distance 
requirement in shadowing is replaced by a uniqueness requirement. The following In Order 
Lemma is due to Yuan and Hunt [PJ. 

Lemma 14 (In Order Lemma) . Let y be a periodic point of period p, and let 

p< min d^Ty^y)/^. 

0<i<j<p 

For any point x, if x stays p-close to Oy for k + 1 steps, then x follows Oy in order for k 
steps. That is, there exists some %' such that for < j < k, 

d{T j x, T i,+1 y) < p. 
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Proof. Let 7 = mm <i<j< p d(T l y, T*y). We first derive a fact about the shift space due to 
its ultrametric properties. Suppose y', y" G Oy and for some point x, d(x, y'), d(x, y") < 7/2. 
By the ultrametric triangle inequality we have 

(1) d(y', y") < max(d(x, y'), d(x, y")) < 7 /2. 

Since 7 was the smallest distance between points in Oy, equation ([I]) gives y' = y" . This 
shows that for any point x, if d(x, Oy) < 7 /2, then there is a unique closest point in Oy to 
x. 

Let x be a point that stays p-close to Oy for k + 1 steps. By definition, we have 

d{x,Oy) <p< 7 /4. 

Since 7 is the minimum distance between points in Oy, there is a unique i' such that 

d(x,T i 'y)<p. 

We then have that 

d{Tx,T l ' +l y)<2p< 1 /2, 

and so T l ' +l y is the unique closest point to Tx. Thus, x follows Oy in order for one step. 
But, by assumption we have d(Tx, Oy) < p, so d(Tx, Oy) = d(Tx,T % +1 y) gives us that Tx 
follows Oy in order for one step and so x follows Oy in order for two steps. Continuing by 
induction, we see that x follows Oy in order for k steps; that is 

d(T j x, T i,+j y) <p for < j < k. 

■ 

Lemma 15 (Shadowing Lemma). For a point y, let S = (T l y,T t+l y, . . . ,T t+k ~ 1 y) be a 
segment of Oy. For any p < 1, if a point x p-shadows S for k steps, the distance from T^x 
to S for < j < k is bounded by 

d{T j x,T i+j y) < p2- ( ^- 1) - j) . 

Proof. Let / = inf{u> : 2~ w < p} and note p < 1 implies I > 1. Since x p-shadows S for k 
steps, we have {T^x) l l = (T^y) 1 ^ 1 for < j < k - 1, and so (x)£ +/ ~ 2 = (T l y) k +l - 2 , which 
gives the result. ■ 

Lemma 16 (Parallel Orbit Lemma). For a function of summable variation f, ifT m x 2~ r - 
shadows Oy for k steps (i.e., there exists i so d{T m+ i x,T l+: >y) < 2~ r for < j < k), then 
for r > 0, 

k-l 

^\f(T m+ ix)-f(T i+ iy)\<V r (f). 

j=0 

Proof. Suppose x, y are points such that d{T mJr ^x, T %+ ^y) < 2~ r where r > 1 for < j < k. 
The Shadowing Lemma (Lemma [T5|) gives us that 

d(T m+j x,T i+j y) < 2-('-+(^ 1 )-^. 

We then have 

k-l r+k-l 

E l/( Tm+ ^) - f(T i+j y)\ < E w/) ^ v ^)- 
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4. Mane-Conze-Guivarc'h normal form and main result 

Heuristically, let us consider the following: Suppose / is optimized by /i max and J fdfi max = 
0. We will define a function /* to represent the "payoff of going backwards to infinity" Before 
we describe what /* means, let us consider the payoff of going backwards a finite number 
of steps. For a point x, there is some point a\x G T~ x x such that f{a\x) > f(pix) for any 
symbol b%. In other words, maximal one-step backwards extension of x. Continuing, 

there is some point a\a\x G T~ 2 x so that f(a\a\x) + f(a\) > ffobxx) + f{bix) for any word 
&2&1, making maximal two-step backwards extension of x. It is important to note 

that the symbol a\ need not be the same as the symbol a\, and so it is in no way immediate 
that there should be some convergent way to pick an infinite maximal backwards extension 
of x. 

However, ignoring these issues for the moment, one can imagine that n-step backwards 
extensions of x look more and more like generic points of /i max (if /i ma x is a periodic orbit 
measure, this should be especially plausible). We now informally define /* as 

/* (a;) = f{afx) + f(a?a?x) + f{afafafx) + ■ • • , 

where • • ■ afa^afx is an infinite maximal backwards extension of x. Since J /d/i max = 0, 
it is reasonable to expect that if /* converges, it is bounded above. Ignoring any issues of 
convergence, consider 

f* o T - /*. 

Suppose x = XqXi ■ ■ ■ is a point with maximal backwards extension • • ■ a 2 aiXQXi ■ ■ ■ . We 
immediately see (/* o T — f*)(x) > f{x), since either the maximal backwards extension of 
Tx = X\X2 ■ ■ ■ is • • • a<ia\XQX\ ■ ■ ■ , which would give us (/* o T — f*)(x) = f(x), or there is an 
alternative backwards extension of Tx that yields a bigger payoff than • • • CL2CL1XQX1 ■ ■ ■ and 
so(f*oT-f*)(x)>f(x). 

Since f*oT — f* is a co-boundary (a function of the form h — hoT) and so integrates to zero 
with respect to any invariant measure, the function / = / — (/* oT — /*) is co-homologous to 
/ (and so J fdfi = J fdfi for all invariant measures /x), with the added property that / < 0. 

The Mane-Conze-Guivarc'h procedure is a way of producing a well defined /*. We use a 
method due to Bousch p], which produces /* as a fixed point of an operator that reflects 
the idea of a maximal backwards extension. 

For / G Ca, define the operator <£j : Ca — > Ca by 

(® f g){x) = max {(f + g)(y)}. 

Proposition 17 (Bousch). Let (A n ) be a lacunary A-sequence. For a fixed function f G Ca 
with sup^g^ J fdfi = 0, the operator as defined above has a fixed point. 

The proof follows standard lines with minor adaptations for the case of /L-norms rather 
than Lipschitz norms. We briefly summarize the steps, referring the reader to Bousch [1] for 
more details. 

Proof sketch. Let A n+ \/A n < 1 — 8 for all n (where < 8 < 1). We claim that $/ maps 
C = {g: Lip^g) < Lip A (f)/S} into itself. We do part of this step in detail since we need 
a fact from it later. Let g G C and let x and x' differ first in their (n — l)st coordinates. 
Using the notation ix to denote the sequence with its first symbol defined by (ix) = i and 
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all remaining symbols defined by (ix)k+i = %k, we have 

®f(g)( x ) ~ ®f(g)( x ') = max(/(zx) + g(ix)) - max(f(jx') + g{jx')) 

» 

< max(/(zx) + g(ix) — f(ix') — g{ix')) 

i 

< var n (/) + var n (#) 

By symmetry we deduce 

(2) var n _i < var n (/) + var n (#). 

Straightforward manipulation then shows that $/(<?) G C. 

Taking a quotient of C by the relation ~ where two functions g and g' are related if they 
differ by a constant, one obtains a compact (with respect to the quotient of the supremum 
norm topology) convex set Cj ~ on which acts continuously. Hence, there is a fixed 
point. This fixed point corresponds to a function h G C such that &f(h) — h + (3 for some 
constant (3. One then shows that sup^g^ f f dfi = implies /3 = ■ 

Theorem 18. Let (Ai) fre a lacunary A- sequence. There exists a constant ^ A > 1, dependent 
only on the choice of A-sequence, such that for all f G C A with sup^g^ J fdfi = 0, there 

exists a co-homologous function f with f < and 

||/IU<7a||/IU v„/<7a||/IUAi. 

Proof. Suppose A n+ i/A n < 1 — 5 for all n (for some < 5 < 1). By Proposition [TTJ, we 
may find h, a fixed point of $/ with 

FlU < ^ h \\h\\oo < {A + 1)— — . 

However, from fl2l) we have 



This gives 



var n _i(/i) = var n _ 1 ($//i) < var„(/) + var n (7i). 



var n (/i o T) var»_i(/i) vax n (/) + var n (/i) 



4 ~~ A ~ A 

and so \\ho T\\ A < ll/IU + II^IU- Let f = f + h — hoT. f has the desired properties and 

2(A + 1 + 6) 



A 



< f\\A+ h\\ A + hoT\\ A < 2\\f\\ A + 2\\h\\ A < 



5 

Let us now focus on finding a constant such that V n f < K\\f\\ A A n . From our bound on 

k>n 



A) we know var fe / < 2(Ai t 1+l5) ||/||^A fe . A k+ i/A k < 1 - 5 for all fc gives that ^) fc>n A fc < 



Ai/<S and so 



v j < 2 < a °; 1 + ^ ii/iua„ 



Letting 7^ = 2(A + 1 + 5)/5 2 completes the proof. ■ 

It should be noted that Theorem [TBI can trivially be applied to functions / where sup^g^ J fdfj, 

(3 ^ by letting / = + (3. 

Corollary 19. Theorem [XH holds with the weakened assumption that limsup A n+ i/A n < 1. 



s 
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Proof. Since limsup A n+ i/A n < 1, we can construct a sequence B n such that B n+ i/B n < 
1 — 5 for some < 5 < 1 and Bi = Ai for i > N for some finite N. Since we only changed a 
finite number of terms of A to produce B, \\ ■ \\a and || • \\b are equivalent. Let M be such 
that \\f\\ A < M\\f\\ B for all f e C A and M' = maxA n /B n . Letting 7 A = MM'^ B completes 
the proof. ■ 

Though not dependent on Theorem [18] it is convenient to note that ja from Theorem [18] 
also bounds V n f in the expected way. 

Fact 20. If (A n ) is a lacunary A-sequence, then for f G Ca 

Vnf < lAWfUAn, 

where 7^4 is as in Theorem [ 



We now have machinery in place to give a quick proof of Proposition [21], which establishes 
a relationship between the number of points in the support of a periodic orbit measure and 
how close such measures come to optimizing a fixed function. This result was first established 
by Yuan and Hunt (without using the Mane-Conze-Guivarc'h Lemma) in [9J for Lipschitz 
functions. 

Proposition 21 (Yuan and Hunt). Let (A n ) be a lacunary A-sequence. Let f G Ca and x 

be an optimal orbit for f (i.e., a typical point of a maximizing measure). Let y be a point 
of period p, and r > 0. If a segment of Ox 2~ r -shadows Oy for one period (i.e., there exist 
m,m' such that d(T i+m x } T i+m ' y) < 2~ r for0<i<p), then 

(/) (x)-i A \\f\\AA r /p<(f) (y)<(f) (x), 

where ja is as in Theorem Q3 

Proof. Let y be a period p point with the property that a segment of Ox 2 _r -shadows Oy 
for p steps. By renaming some T^y as y, without loss of generality we may assume that a 
segment of Ox 2 _r -shadows y. That is, there exists some m so that d(T m+l x, T l y) < 2~ r for 
< i < p. Let x' = T m x. 

By Theorem!!!], we may find / co-homologous to / with f(Ox) = f(Ox') = (f) (x). Since 
for < i < p we have 

d^x'.Ty) < 2-( r +(p- 1 -' J )) ) 
we may apply the Parallel Orbit Lemma (|T6|) to get 

E (/(TV) - /(TV)) = (E /(TV) ) - p (/) (y) 



< Vrf- 



The proposition follows from the fact that f(T l x') = (/) (x) and that by Theorem [TS]V r / < 
7J/IUA" ■ 

Using methods similar to those in Yuan and Hunt[9J, one can show that Proposition |2~T1 
holds for any function / of summable variation, and one can produce a slightly stronger 
bound of (f)(x)-4V r f/p<(f)(y)<(f)(x). 

We are now ready to prove Theorem [2] by using d A (-,Oy) as a "sharpest" function that 
will penalize any measure that gives mass to (Oy) c . 

Theorem (Theorem [2]). Let (A n ) be an A-sequence satisfying A n+1 /A n — > 0. For a periodic 
orbit measure fi y supported onOy, let P y = {/ G Ca(O) : fi y is the unique maximizing measure}. 
Then, (j y periodic (P y )° is dense in Ca(Q) (where (P y )° is the interior of P y ). 
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Proof. We will show that for any function /, there exists an arbitrarily small perturbation, 
/, of / and a periodic orbit measure p, y , such that all functions in an open neighbourhood 
of / are uniquely optimized by /i r 

Since liminf A n+ i/A n = 0, by Corollary [191 passing to an equivalent norm if necessary, we 
may assume A n+ i/A n < 1/2 for all n. Fix / G Ca and let /i max be an optimizing measure 
for /. Fix x e supp(/i max ). Without loss of generality, assume (/) (x) = and let / be 
co-homologous to / with / < 0. 

Suppose we showed that an arbitrarily small perturbation / + g of / were such that the 
open ball of radius e about f + g is uniquely optimized by a periodic orbit measure fjL y . Since 
/ and / are co-homologous, this means that / + g is uniquely optimized by \i y and in fact 
the open ball of radius e about / + g is uniquely optimized by \i y . Thus, it is sufficient to 
only consider small perturbations of /. 

Fix < e < 1. For a fixed k (to be determined later), find a minimal recurrence in x of a 
block of k symbols. That is, find % < j such that d(T i x, T^x) < 2~ k but for % < i' < f < j, 
we have d(T l x, T J x) > 2~ k . Notice that such a minimal recurrence exists for all k by the 
pigeonhole principle. 

Let p = j — i and let y be the point of period p satisfying (y)]" 1 = {x){~ ■ Since 
d{T l x,T^x) < 2~ k we see that (y){ +k " 1 = (x){^ 1 - It follows that the orbit segment 
(T*x, . . . , Ti~ l x) 2~( fc+1 )-shadows T*y. 

Let 2~ l = mmi<ii < ji < j{d(T' 1 y,T^ y)} be the minimum distance between points in Oy and 
notice that by construction of y and the ultrametric property, 2~ l > 2~( k ~ 1 \ 

Define the perturbation function g by g(t) = — ^(t, Oy), and let / = / — eg. 

We will now show that provided k is sufficiently large, the measure supported on Oy is 
the unique optimizing measure for functions lying in a || • ||^-open ball about /. 

Let Q = {/ + h : \\h\\A < ea} with a < 1 to be determined later. Fix / — eg + h e Q and 
let q be its normalization, q — f — eg + h + (3 where (3 = — sup^g^ j(f — eg + h)dp. 

Let 7^ be as in Theorem [TS1 Recall that 7^4 > 1. We then have V n f < TaH/HaAi- 
Further, since e, a < 1, Fact [201 gives us V n (eg),V n h < ^aAu. Let L = 7a(||/IU + 2)- Since 
V n (/ — £5 + h) = V n q we have 

V n /, V n /, V n g < LA„ and 7A V n / < LA n , 

with the second inequality following from Fact[20j Further, L only depends on A and ||/||a- 
Since x 2 _ ( fe+1 )-shadows Oy for p steps, we can get a good bound for f3. By construction 

(?) (</) = (/> (y) - ^ («/> (y) + (h) (y) + /3< 0, 

and so 

0<-(f) (y) + e (#> (y) - W (y) = - (/) (y) - W (y). 

Proposition[2I]gives us (/) (x)-j A Vk+i(f)/p = -lA^k+i{f)/p < (f) (y) so that - (/) (y) < 
LAfc + i/p. Combining this with the fact that \\h\\oo < ||/i|U < £0" gives /3 < LA^+i/p + ecr. 
Since q = f — eg + h + (3 and the first two terms are non-positive, we see that 

h(u) + /3 < + 2eo for all u G fi; and 

(3) ^ 

g(o;) < h 2£cr for all u E fl. 

p 
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Let be the co-cycle q^(z) = q(T n ~ 1 z) + q(T n ~ 2 z) + ■ ■ - + q(z), and note that if n > m, 
q(ri(z) - g (m) (z) = q( n - m \T m z). 

We know by Proposition [T7]that there exists q*, a fixed point of $ g . Let z e Vt be arbitrary. 
We know there exists some symbol a\ such that q*(z) = q(a±z) + q*{a\z). Iterating this 
process, we may find an infinite sequence of preimages (a*) such that for any n > 0, 

q*(z) = q(a 1 z) + q(a 2 a 1 z) H h q(a n ■ ■ ■ a\z) + q*(a n ■ ■ ■ a\z) 

(4) , , 

= g w (a n • • • a x z) + q* (a n ■ ■ ■ a x z) . 

Fix any such preimage infinite sequence (aj). We will now identify a (possibly finite) 
sequence of times, (£„), by the following recursive procedure: For a time t, define co t = 
Let t be the smallest number (if it exists) such that d(u to ,Oy) > 2 (fc+1) . 
Given t n , let t n+ \ > t n be the next smallest number (again, if it exists) so that d(ut n+1 , Oy) > 
2~( fc+1 ). Our goal is to show that the length of the sequence is finite. From this it follows 
that the preimages u t accumulate to Oy. It will then follow that the periodic orbit measure 
supported on Oy is the unique maximizing measure. 

Since 2~ l > 2~ ( - k ~ 1 * ) (and so 2~'/4 > 2~( fe+1 )), for times strictly between t n and the 
In Order Lemma (Lemma UM gives that we 2~^ +1 ^-shadow Oy. 

Suppose t n — £ n _i > 1 and let y' G Oy be the point that is 2~( /c+1 )-shadowed by u t „ for 
t n — — 1 steps (that is d(T l Lu t7l ,T l y) < 2~( fc+1 ) for < i < t n — t„_i). Summing along 
this segment, the Parallel Orbit Lemma (Lemma [T6j) gives us 

[(ZCTHJ - q(Ty')] < V k+1 (q) < LA k+1 . 



0<i<t n —t n -i 



so that 



q{T i u tn )<LA k+l + Yl ^ T V) 



0<i<t n — tn-l 0<i<tn—tn-l 

Grouping Ylo<i<t -t _ q(T l y') in blocks of length p together with at most p — 1 singleton 
terms and using (j3J), we see 

£ q(T l u tn ) < LA k+ i + mp (q) (y) + (p - 1) (LA k+1 /p + 2ea) , 

0<i<t„-t n -i 

where m is the integer part of (t n — £ n _i — I) /p. Since (q) (y) < 0, we simplify to get 

(5) Yl iC^O < 2LA k+1 + 2(p - l)ea. 

0<i<t n —t„-l 

Notice that this equation holds also (trivially) if t n = t n -\ + 1. We now evaluate q{oj tn ): 

q(ut n ) = /(c«0 - eg{u tn ) + h{u tn ) + (3. 

By construction we have d(co tn , Oy) > 2~ k so that g(co tn ) > A k . Using fl3]) again and the fact 
that / < we have 

(6) q(u tn )<-eA k + ^^ + 2ea. 

p 

Combining equations (jSJ) and (jf)]) we get 

g (*»-*n-0( Wtn ) < - eAk + 3LA k+1 + 2pea, 
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and so for a < A k j (4p) we have 

g <*»-*»-0( WtB ) < -i Ak + 3LA k+l . 

Since L only depends on (A n ) and ||/||a, our assumption that A k+ i/A k — > ensures that 
there exists a A; such that a = \A k — 3LA k+ i > 0. Fix this k and fix o = A k /(4p). Let 
(a;)^ -1 be the minimal recurrence segment identified in the proof and y be the corresponding 
periodic orbit. This fixes the open ball Q whose centre is at a distance e from /. 

We have shown that for any function in Q, its normalized version q satisfies q^^-^iu^) < 
—a. Expanding using (j3J) now gives 

n 

g*K) - q*(u tn ) = q {tn - t0 \u tn ) = J^^'^M < -no. 

i=l 

But q* is a bounded function and so the number of terms in the sequence (t n ) is finite. 

Since z was chosen arbitrarily, this is sufficient to show the periodic orbit measure sup- 
ported on Oy uniquely optimizes q. If not, then there would be points z and preimage 
sequences (<2j) satisfying (TJJ that do not eventually follow Oy, and so (t n ) would be infinite. 
■ 

Theorem [2] proves both (a) that a function optimized by an aperiodic point can be per- 
turbed to be optimized by a periodic point and (b) that a function optimized by periodic 
point can be perturbed to lie in an open set of functions optimized by the same periodic 
point. Following the methods of Yuan and Hunt in |9j, one can prove (b) in the general 
context of A- norm spaces (dropping the assumption that A n+1 /A n — > entirely). 
Acknowledgments. We would like to thank the referee for a careful reading and very 
useful suggestions. 
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